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DRUG TARGETS IN CANDIDA ALBICANS 

The present invention is concerned with the 
identification of genes or functional fragments 
thereof from Candida albicans which are critical for 
growth and cell division and which genes may be used 
as selective drug targets to treat Candida albicans 
associated infections. Novel nucleic acid sequences 
from Candida albicans are also provided and which 
encode the polypeptides which are critical for growth 
of Candida albicans . 

Opportunistic infections in immunocompromised 
hosts represent an increasingly common cause of 
mortality and morbidity. Candida species are among 
the most commonly identified fungal pathogens 
associated with such opportunistic infections, with 
Candida albicans being the most common species. Such 
fungal infections are thus problematical in, for 
example, AIDS populations in addition to normal 
healthy women where Candida albicans yeasts represent 
the most common cause of vulvovaginitis . 

Although compounds do exist for treating such 
disorders, such as for example, amphotericin, these 
drugs are generally limited in their treatment because 
of their toxicity and side effects. Therefore, there 
exists a need for new compounds which may be used to 
treat Candida associated infections in addition to 
compounds which are selective in their action against 
Candida albicans. 

Classical approaches for identifying anti- fungal 
.compounds have relied almost exclusively on inhibition 
of fungal or yeast growth as an endpoint . Libraries 
of natural products, semi -synthetic, or synthetic 
chemicals are screened for their ability to kill or 
arrest growth of the target pathogen or a related 
nonpathogenic model organism. These tests are 
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cumbersome and provide no information about a 
compounds mechanism of action. The promising lead 
compounds that emerge from such screens must then be 
tested for possible host -toxicity and detailed 
5 mechanism of action studies must subsequently be 

conducted to identify the affected molecular target. 

The present inventors have now identified a range 
of nucleic acid sequences form Candida albicans which 
encode polypeptides which are critical for its 
10 survival and growth. These sequences represent novel 
targets which can be incorporated into an assay to 
selectively identify compounds capable of inhibiting 
expression of such polypeptides and their potential 
use in alleviating diseases or conditions associates 
15 with Candida albicans infection. 

Therefore, according to a first aspect of the 
invention there is provided a nucleic acid molecule 
encoding a polypeptide which is critical for survival 
and growth of the yeast Candida albicans and which 
nucleic acid molecule comprises any of the sequences 
of nucleotides in Sequence ID Numbers 1, 2, 3, 5, 10 
11, 12, 14, 16, 18, 20, 21, 23, 25, 27, 29, 31, 33, 
37, 39, 41, 44, 45, 46, 49, 50, 52, 55, 57, 59, 61,' 
63, 65, 67, 70, 72, 74, 76, 78, 80, 81, 83, 85, 87, 
89, 91, 93, 95, 97, 99, 101, 104, 106, 108, 110 and 
113, or the sequences of nucleotides identified in 
Figures 9 to 13 . 

Whilst the molecules defined herein have been 
established as being critical for growth and 
metabolism of Candida albicans, for some of the 
molecules no apparent functionality has been assigned 
by virtue of the fact that no functionally related 
sequences in other prokaryotic or eukaryotic organism 
can be found in respective databases. Thus, 
advantageously these sequences may be species specific 
in which case they may be used be used as selective 
targets for treatment of diseases mediated by Candida 
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Albicans infection. Thus, in one aspct of the 
invention the nucleic acid molecules preferably 
comprise the sequences identified in sequence ID Nos 
1. 2, 3, 5, 10, 11, 12, 14, 16, 17, 18, 46, 49 50 
52, 55, 57, 59, 61, 63, 65, 87, 89, 91, 93, 95' 97' 
99, 101, 104, 106, 108, and 110 and the corresponding 
polypeptide sequences identified in Table 1. 

Some of sequences according to invention have 
been assigned a particular function. Nucleic acid 
molecules according to this aspect of the invention 
comprise any of the sequences as described in sequence 
ID Nos, 20, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 
41, 45, 65, 70, 72, 74, 76, 78, 80, 81, 83, 85 and' 113 
and the corresponding polypeptides identified in Table 

Letters utilised in the nucleic acid 
sequences according to the invention to represent the 
genetic code and which are not recognisable as letters 
of the genetic code signify a position in the nucleic 
acid sequence where one or more of bases A, G, C or T 
can occupy the nucleotide position. Representative 
ambiguity codes used to identify the range of bases 
which can be used are as follows: 



M: 


A 


or 


C 






R: 


A 


or 


G 






W: 


A 


or 


T 






S: 


C 


or 


G 






Y: 


C 


or 


T 






K: 


G 


or 


T 






V: 


A 


or 


C 


or 


G 


H: 


A 


or 


C 


or 


T 


D: 


A 


or 


G 


or 


T 


B: 


C 


or 


G 


or 


T 


N: 


G 


or 


A 


or 


T or C 



In one embodiment of the above identified 



asDects 
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of the invention the nucleic acid may comprise a mRNA 
molecule or alternatively a DNA and preferably a cDNA 
molecule . 

Also provided by the present invention is a 
5 nucleic acid molecule capable of hybridising to the 
nucleic acid molecules according to the invention 
under high stringency conditions, such as for example, 
an antisense molecule. 

Stringency of hybridisation as used herein refers 
0 to conditions under which polynucleic acids are 

stable. The stability of hybrids is reflected in the 
melting temperature (Tm) of the hybrids. Tm can be 
approximated by the formula: 



81.5°C + 16.6 (log 10 (Na*] + 0.41 (%G&C) -6 



001/1 



wherein 1 is the length of the hybrids in nucleotides. 
Tm decreases approximately by 1-1.5°C with every 1% 
decrease in sequence homology. 

The nucleic acid capable of hybridising to 
nucleic acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95 to 97% homologous 
to the nucleotide sequences according to the 
invention. 

The DNA molecules according to the invention may, 
advantageously, be included in a suitable expression 
vector to express polypeptides encoded therefrom in a 
suitable host. 

The present invention also comprises within its 
s.cope proteins or polypeptides encoded by the nucleic 
acid molecules according to the invention or a 
functional equivalent, derivative or bioprecursor 
thereof . 

Therefore, according to a further aspect of the 
invention there is provided a polypeptide which is 
•critical for the growth and survival of Candida 
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albicans comprising an amino acid sequence of any of 
Sequence ID Numbers 4, 6 to 9. 13. is, 19 , 22 J 
*B. 30. 32. 34. 36. 38, 40. 42, 43, 47, 48, si S3 
54, 56, 58, 60. 62, 64, 66, 68. 69. 71, 73, 75 ' 77' 
™. 82. 84. 86, 88, 90, 92, 94, 96, 98, 1 00 , 1 02 ^ 
105. 107. 109. Ill, n 2 , 114 or che seguences 102 ' 103 " 
llliustrated in Figures 14 or 15. 

An expression vector according to the invention 
includes a vector having a nucleic acid according to 
the invention operably linked to regulatory sequences 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxta position wherein 
the components described are in a relationship 
permitting them to function in their intended manner 
Such vectors may be transformed into a suitable host' 
cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 
aspect, the invention provides a process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed or 
transfected with an expression vector as described 
above under conditions to provide for expression by 
the vector of a coding sequence encoding the 
polypeptides, and recovering the expressed 
polypeptides. 

The vectors may be, for example, plasmid, virus 
or phage vectors provided with an origin of 
replication, optionally a promoter for the expression 
of said nucleotide and optionally a regulator of the 
promoter. The vectors may contain one or more 
selectable markers, such as, for example, ampicillin 
resistance. 

Polynucleotides according to the invention may be 
inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense rna. Antisense RNA or other antisense 
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nucleic acids may be produced by synthetic means. 

In accordance with the present invention, a 
defined nucleic acid includes not only the identical 
nucleic acid but also any minor base variations 
including in particular, substitutions in bases which 
result in a synonymous codon (a different codon 
specifying the same amino acid residue) due to the 
degenerate code in conservative amino acid - 
substitutions. The term "nucleic acid sequence" also 
includes the complementary sequence to any single 
stranded sequence given regarding base variations. 

The present invention also advantageously 
provides nucleic acid sequences of at least 
approximately 10 contiguous nucleotides of a nucleic 
acid according to the invention and preferably from 10 
to 50 nucleotides. These sequences may, 
advantageously be used as probes or primers to 
initiate replication, or the like. Such nucleic acid 
sequences may be produced according to techniques well 
known in the art, such as by recombinant or synthetic 
means. They may also be used in diagnostic kits or 
the like for detecting the presence of a nucleic acid 
according to the invention. These tests generally 
comprise contacting the probe with the sample under 
hybridising conditions and detecting for the presence 
of any duplex or triplex formation between the probe 
and any nucleic acid in the sample. 

According to the present invention these probes 
may be anchored to a solid support. Preferably, they 
are present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample. The probes can be spotted onto the array or 
synthesised in situ on the array. (See Lockhart et 
al., Nature Biotechnology, vol. 14, December 1996 
"Expression monitoring by hybridisation to high 
density oligonucleotide arrays" . A single array can 
contain more than 100, 500 or even 1,000 different 
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probes in discrete locations. 

Advantageously, the nucleic acid sequences, 
according to the invention may be produced using such 
recombinant or synthetic means, such as for example, 
5 using PCR cloning mechanisms which generally involve 
making a pair of primers, which may be from 
approximately 10 to 50 nucleotides to a region of the 
gene which is desired to be cloned, bringing the 
primers into contact with mRNA, cDNA, or genomic DNA 
10 from a human cell, performing a polymerase chain 
reaction under conditions which bring about 
amplification of the desired region, isolating the 
amplified region or fragment and recovering the 
amplified DNA. Generally, such techniques as defined 
15 herein are well known in the art, such as described in 
Sambrook et al (Molecular Cloning: a Laboratory 
Manual, 1989) . 

The nucleic acids or oligonucleotides according 
to the invention may carry a revealing label . 
20 Suitable labels include radioisotopes such as "p or 
39 S, enzyme labels or other protein labels such as 
biotin or fluorescent markers. such labels may be 
added to the nucleic acids or oligonucleotides of the 
invention and may be detected using known techniques 
25 per se. 

The polypeptide or protein according to the 
invention includes all possible amino acid variants 
encoded by the nucleic acid molecule according to the 
invention including a polypeptide encoded by said 

30 molecule and having conservative amino acid changes. 
Polypeptides according to the invention further 
include variants of such sequences, including 
naturally occurring allelic variants which are 
substantially homologous to said polypeptides. In 

35 this context, substantial homology is regarded as a 

sequence which has at least 70%, preferably 80 or 90% 
amino acid homology with the polypeptides encoded by 



WO 00/09695 



PCT/EP99/05991 



- 8 - 

the nucleic acid molecules according to the invention. 

A nucleic acid which is particularly advantageous 
is one comprising the sequences of nucleotides 
according to Seq ID Nos l and 91 in which are specific 
5 to Candida albicans with no functionally related 

sequences in other prokaryotic or eukaryotic organism 
as yet identified from the respective genomic 
databases . 

Nucleotide sequences according to the invention 
.0 are particularly advantageous for selective 

therapeutic targets for treating Candida albicans 
associated infections. For example, an antisense 
nucleic acid capable of binding to the nucleic acid 
sequences according to the invention may be used to 
5 selectively inhibit expression of the corresponding 
polypeptides, leading to impaired growth of the 
Candida albicans with reductions of associated 
illnesses or diseases. 

The nucleic acid molecule or the polypeptide 

0 according to the invention may be used as a 
medicament, or in the preparation of a medicament, for 
treating diseases or conditions associated with 
Candida albicans infection. 

Advantageously, the nucleic acid molecule or the 
5 polypeptide according to the invention may be provided 
in a pharmaceutical composition together with a 
pharmaceutical ly acceptable carrier, diluent or 
excipient therefor. 

Antibodies to the protein or polypeptide of the 
) present invention may, advantageously, be prepared by 
techniques which are known in the art. For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse, with the polypeptide 
according to the invention or an epitope thereof and 

1 recovering immune serum. Monoclonal antibodies may be 
prepared according to known techniques such as 
described by Kohler R. and Milstein C, Nature 
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(1975)256, 495-497. 

Antibodies according to the invention may also be 
used in a method of detecting for the presence of a 
polypeptide according to the invention, which method 
5 comprises reacting the antibody with a sample and 

identifying any protein bound to said antibody. a kit 
may also be provided for performing said method which 
comprises an antibody according to the invention and 
means for reacting the antibody with said sample. 
10 Proteins which interact with the polypeptide of 

the invention may be identified by investigating 
protein-protein interactions using the two-hybrid 
vector system first proposed by Chien et al (1991) . 
This technique is based on functional 
15 reconstitution in vivo of a transcription factor which 
activates a reporter gene. More particularly the 
technique comprises providing an appropriate host cell 
with a DNA construct comprising a reporter gene under 
the control of a promoter regulated by a transcription 
2 0 factor having a DNA binding domain and an activating 

domain, expressing in the host cell a first hybrid DNA 
sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
25 domain of the transcription factor, expressing in the 
host at least one second hybrid DNA sequence, such as 
a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 
investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
35 hybrid DNA sequences encoding the binding protein. 



30 



An example of such a technique utilises the GAL4 
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protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 
domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
5 binding domain. Nucleotide vectors may be 

constructed, one of which comprises the nucleotide 
residues encoding the DNA binding domain of GAL4 . 
These binding domain residues may be fused to a known 
protein encoding sequence, such as for example the 
10 nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 
binding domain of GAL4 . These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
according to the invention and the protein to be 
tested leads to transcriptional activation of a 
reporter molecule in a GAL- 4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
{J-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

Further provided by the present invention is one 
or more Candida albicans cells comprising an induced 
mutation in the DNA sequence encoding the polypeptide 
25 according to the invention. 

A further aspect of the invention provides a 
method of identifying compounds which selectively 
inhibit or interfere with the expression, or the 
functionality of polypeptides expressed from the 
nucleotides sequences according to the invention or 
the metabolic pathways in which these polypeptides are 
involved and which are critical for growth and 
survival of Candida albicans, which method comprises 
(a) contacting a compound to be tested with one or 
more Candida albicans cells having a mutation in a 
nucleic acid molecule according to the invention which 
mutation results in overexpression or underexpression 
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of said polypeptides in addition to one or more wild 
type Candida cells, (b) monitoring the growth and/or 
activity of said mutated cell compared to said wild 
type wherein differential growth or activity of said 
one or more mutated Candida cells provides an 
indication of selective action of said compound on 
said polypeptide or another polypeptide in the same or 
a parallel pathway. 

Compounds identifiable or identified using the 
method according to the invention, may advantageously 
be used as a medicament, or in the preparation of a 
medicament to treat diseases or conditions associated 
with Candida albicans infection. These compounds may 
also advantageously be included in a pharmaceutical 
composition together with a pharmaceutically 
acceptable carrier, diluent or excipient therefor. 

A further aspect of the invention provides a 
method of identifying DNA sequences from a cell or 
organism which DNA encodes polypeptides which are 
critical for growth or survival, which method 
comprises (a) preparing a cDNA or genomic library from 
said cell or organism in a suitable expression vector 
which vector is such that it can either integrate into 
the genome in said cell or that it permits 
transcription of antisense RNA from the nucleotide 
sequences in said cDNA or genomic library, (b) 
selecting transf ormants exhibiting impaired growth and 
determining the nucleotide sequence of the cDNA or 
genomic sequence from the library included in the 
vector from said transf ormant . Preferably, the cell 
or organism may be any yeast or filamentous fungi, 
such as for example, Saccharomyces cervisiae, 
Saccharomyces pombe or Candida albicans. 

A further aspect of the invention provides a 
pharmaceutical composition comprising a compound 
according to the invention together with a 
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pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

The present invention may be more clearly 
understood with reference to the accompanying example, 
which is purely exemplary, with reference to the 
accompanying drawings wherein: 

Figure 1: ± s a n illustration of A) 

Intergration of the antisense 
library plasmid (here shown as a 
linear fragment) at a site (eg. 
GAL1 promoter region) within the 
genome which is non- homologous to 
the insert DNA. As a result the 
GALlp region is duplicated and 
antisense RNA can be formed from 
GENE X upon induction of GALlp, 
and B) Intergration due to 
homologous recombination of the 
gene insert (GENE X) of an 
antisense library clone (here 
shown as a linear fragment) with 
the homologous gene (gene x) 
within the Candida genome. As a 
result this gene is duplicated. 
The first copy of the gene geNE X, 
is flanked by upstream its 
endogenous promoter and 
downstream, oppositely-oriented, 
the GAL2 promoter resulting in a 
so-called "collision construct". 
Antisense RNA can be formed from 
GENE X upon induction of GALlp. 
The second copy of the gene, GEne 
X, is devoid of a promoter and 
will not be transcribed. 
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Figure 4 : 
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Figures 6 : 
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is an illustration of the vectors 
used for the preparation of a cDNA 
antisense library, pGALlPNiST-1, 
(left) and a genomic library, 
pGALlPNiST-1 (right) . 

Growth curves in S-glucose and S- 
galactose medium of respectively 
the wild type CAI-4 strain and two 
transformants (clone 3 6 and 38) 
showing antisense induced 
reduction in growth and overall 
impaired growth, respectively. 
Growth curves in S-glucose+maltose 
and S-galactose+maltose medium of 
respectively the wild type CAI-4 
strain and transformants resulting 
from antisense library 
transformation . 

is an illustration of promoter 
activity of the C. albicans GAL1 
promoter in the absence and 
presence of maltose as a carbon 
source . 

is a Northern blot analysis of C. 
albicans mRNA in wild type and 
clone 3 6 using a SAM2 and a TEF3 
specific probe. 

is A) a Northern blot analysis of 
sequences of C. albicans mRNA in 
wild type and clone 3 8 using a 
RNRl and an ACT1 specific probe; 
and B) Real Time Quantitative PCR 
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Figure 8 : 



Figure 9 : 



Figure 10: 



Figure 11: 



Figure 12 : 



Figure 13 : 



Figure 14 : 



Figure 15: 
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on C. albicans mRNA in wild 
type and clone 3 8 using a 
RNR1 and ACT1 specific 
f luorogenic probe . 

is a nucleotide sequence of 
plasmid pGALlPNiST- 1 . 

is a nucleotide sequence of 
plasmid pGALlPSiST-1 . 

is a nucleotide sequence of clone 
3 8 which has been assigned RNR1 
functionally. 

is a nucleotide sequence of clone 
113g4. 

is a nucleotide sequence of clone 
207g4 

is a nucleotide sequence of clone 
66g4 . 

is a nucleotide sequence of clone 
3 6 which has been assigned Sam2 
functionally. 

is an amino acid sequence of clone 
38. 

is an amino acid sequence of clone 
36. 
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Figures 16 to 70 
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Figure 71 

10 
15 
20 

Figure 72 

25 
30 

Figure 73 



are growth curves of Candida 
albicans showing antisense induced 
reduction in growth by inhibition 
of molecules according to the 
invention . 

is an illustration of the results 
obtained in a Northern blot 
analysis of a FAL1 single allele 
knock-out . 

Lane 1: RNA MWM I (Boerhinger 

Mannheim) 
Lane 2: WT + gal + mal + LiAc 
Lane 3: FAL1 + gal + mal + LiAc 
Lane 4: RNA MWM I DIG labelled 

(Boerhinger Mannheim) 
Lower FAL1 transcript levels are 
observed in the FAL1 single allele 
knock-out strain compared to the 
wild type strain. 

is an illustration of growth 
curves obtained of a FAL1 single 
allele knock-out strain and wild 
type. The FAL1 single allele 
knock-out grows equal to the wild 
type. However, it is 
significantly more resistant to 
Hygromycin B. 

is an illustration of the results 
of a Northern blot obtained using 
a RNR1 single allele knock-out. 
Correct and single integration of 
RNR1 disruption cassette was 
confirmed by PCR and Southern blot 
analysis. Lower RNR1 transcript 
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levels are observed in the RNR1 
single allele knock-out strain 
compared to the wild type strain. 
This result was confirmed by 
quantitative PCR (QT-PCR) . 

is an illustration of the growth 
curves obtained from various RNR1 
single allele knock-out strains. 
The RNR1 single allele knock-out 
shows an extended LAG phase 
compared to wild type. 

is an illustration of the growth 
curves obtained from a single SAM2 
knock-out clone against wild type. 
Inoculum for SAM2 was somewhat 
higher; at equal inocula, growth 
of SAM2 single allele knock-out is 
slightly higher. 

is an illustration of growth 
curves obtained in a RHOl single 
allele knock-out. Growth of the 
RHOl single allele knock-out is 
impaired compared to wild type 
growth . 

is an illustration of the results 
obtained from an RNA level 
analysis for a MEG1 single allele 
knock-out. MEG1 expression was 
decreased more than 14 fold in the 
MEG1 single allele knock-out 
compared to the WT . 



Figure 78 



is an illustration of the results 
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obtained in a growth curve 
analysis of MEG1 single allele 
knock-out compared to wild type. 
Inoculum for SAM2 was somewhat 
higher; at equal inocula growth of 
SAM2 single allele knock-out is 
slightly lower. 

is an illustration of the results 
obtained from an RNA level QT-PCR 
analysis on a MAA single allele 
knock-out. Correct and single 
integration of MAA disruption 
cassette was confirmed by both PCT 
and Southern blot analysis. MAA 
expression was decreased two fold 
in MAA knock-out compared to wild 
type. 

is an illustration of the growth 
curves obtained in a single allele 
knock-out MAA clone compared to 
wild type. Inoculum for MAA was 
somewhat higher; at equal inocula 
growth of MAA single allele knock- 
out is slightly slower. 

is an illustration of the results 
obtained from a RNA level QT-PCR 
analysis on a RPL27 single allele 
knock-out. RPL27 expression was 
decreased more than three fold in 
the RPC27 knock-out compared to 
wild type. 

is an illustration of the growth 
curves obtained from a RPL27 
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single allele knock-out clone 
compared to wild type. The RPL27 
single allele knock-out strain 
grows equally to the wild type 
5 strain. 

Identification of novel drug targets in C. 
10 albicans by anti-sense and disruptive integration 
The principle of the approach is based on the 
fact that when a particular C. albicans mRNA is 
inhibited by producing the complementary anti-sense 
RNA, the corresponding protein will decrease. If this 
15 protein is critical for growth or survival, the cell 
producing the anti-sense RNA will grow more slowly or 
will die. 

Since anti-sense inhibition occurs at mRNA level, 
the gene copy number is irrelevant, thus allowing 
20 applications of the strategy even in diploid 
organisms . 

Anti-sense RNA is endogenously produced from an 
integrative or episomal plasmid with an inducible 
promoter; induction of the promoter leads to the 

25 production of a RNA encoded by the insert of the 

plasmid. This insert will differ from one plasmid to 
another in the library. The inserts will be derived 
from genomic DNA fragments or from cDNA to cover-to 
the extent possible- the entire genome. 

30 The vector is a proprietary vector allowing 

integration by homologous recombination at either the 
homologous insert or promoter sequence in the Candida 
genome. After introducing plasmids from cDNA or 
genomic libraries into C. albicans, transf ormant s are 

35 screened for impaired growth after promoter (& thus 
anti-sense) induction in the presence of lithium 
acetate. Lithium acetate prolongs the Gl phase and 
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thus allows anti-sense to act during a prolonged 
period of time during the cell cycle. Trans formants 
which show impaired growth in both induced and non- 
induced media, thus showing a growth defect due to 
5 integrative disruption, are selected as well. 

Transf ormants showing impaired growth are 
supposed to contain plasmids which product anti-sense 
RNA or mRNAs critical for growth or survival. Growth 
is monitored by measuring growth-curves over a period 

10 of time in a device (Bioscreen Analyzer, Labsystems) 

which allows simultaneous measurement of growth-curves 
of 200 transf ormants . 

Subsequently plasmids can be recovered from the 
transf ormants and the sequence of their inserts 

15 determined, thus revealing which mRNA they inhibit. In 
order to be able to recover the genomic or cDNA insert 
which has integrated into the Candida genome, genomic 
DNA is isolated, cut with an enzyme which cuts only 
once into the library vector (and estimated approx. 

20 every 4096 bp in the genome) and relegated. PCR with 
primers flanking in the insert will yield (Partial) 
genomic or cDNA inserts as PCR fragments which can 
directly be sequenced. This PCR analysis (on ligation 
reaction) will also show us how many integrations 

25 occurred. Alternatively the ligation reaction is 

transformed to E. coli and PCR analysis is performed 
on colonies or on plasmid DNA derived thereof. 

This method is employed for a genome wide search 
for novel C. albicans genes which are important for 

30 growth or survival. 

MATERIALS AND METHODS 
Construction of pGallNIST-1 

35 



pGALlPNiST-1 (integrative antisense Sfil-NotI vector) 
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was constructed as described by Logghe et al. , 
submitted . 

Construction of pGALlPSiST-1 

5 

The vector pGALlPSiST-1 (integrative Sfil-Sfil vector) 
was created for cloning the small genomic DNA 
fragments behind the GAL1 promoter. The only 
difference with pGALlPNiST-1 is that the hlFNb insert 

10 fragment in pGALlPSiST-1 is flanked by two Sfil sites 
instead of a Sfil and a NotI site as in pGALlPNiST-1 . 
To construct pGALlPSiST-1 the EcoRI-Hindlll fragment, 
containing hlFNb flanked by a Sfil and a NotI site, of 
pMAL2pHiET-3 (Logghe M. , unpublished) was exchanged by 

15 the EcoRI-Hindlll fragment, containing hlFNb flanked 
by two Sfil sites, from YCp50S-S (an E. coli / S. 
cerevisiae shuttle vector derived from the plasmid 
YCp50, which is deposited in the ATCC collection 
(number 37419; Thrash et al., 1985); an EcoRI-Hindlll 

20 fragment, containing the gene hlFNb, which is flanked 
by two Sfil sites, was inserted in YCp50, creating 
YCp50S-S), resulting into plasmid pMAL2PSiST-l . The 
MAL2 promoter from pMAL2PSiST-l (by a Nael-Fspl 
digest) was further replaced by the GAL1 promoter from 

25 pGALlPNiST-1 (via a Xhol-Sall digest) , creating the 
vector pGALlPSiST-1 . 

Preparation of C . albicans genomic library 

30 AC. albicans genomic DNA library with small DNA 

fragments was prepared for integrative disruption. 
Genomic DNA of C. albicans B2630 (ATCC No. 44858) was 
isolated following a modified protocol of Blin and 
Stafford (1976) . To obtain enrichment for genomic DNA 

35 fragments of the desired size, the genomic DNA was 

partially digested. Enrichment of small DNA fragments 
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was obtained with 70 units of Alul on 10 mg of genomic 
DNA for 20 min. T4 DNA polymerase (Boehringer) and 
dNTPs (Boehringer) were added to polish the DNA ends. 
After extraction with phenol-chloroform the digest was 
5 size-fractionated on an agarose gel. The genomic DNA 

fragments with a length of 0.5 to 1.25 kb were eluted 
from the gel by centrifugal filtration (Zhu et al., 
1985). Sfil adaptors (5* GTTGGCCTTTT) were attached to 
the DNA ends (blunt) to facilitate cloning of the 

10 fragments into the vector. After ligation of these 
adaptors to the DNA fragments a second size- 
f ractionation was performed on an agarose gel. The 
small genomic DNA fragments were cloned upstream of 
the GAL1 promoter in the vector pGALlPSiST-1 . Qiagen- 

15 purified pGALlPSiST-1 plasmid DNA was digested with 
Sfil and the largest vector fragment eluted from the 
gel by centrifugal filtration (Zhu et al., 1985). The 
ligation mix was electroporated to MC1061 (...) E. coli 
cells . 

20 

C. albicans cDNA library 

Total RNA was extracted from C. albicans strain B2630 
grown on respectively minimal (SD) and rich (YPD) 

25 medium as described by Sambrook et al. (1989) . mRNA 

was prepared from total RNA using the Invitrogen Fast 
Track procedure. First strand cDNA was synthesised 
with Superscript Reverse Transcriptase (BRL) and with 
an oligo dT-Notl Primer adapter. After second strand 

30 synthesis, cDNA was polished with Klenow enzyme and 
purified over a Sephacryl S-400 spin column. 
Phosphorylated Sfil adapters were then ligated to the 
cDNA, followed by digestion with the NotI restriction 
enzyme. The Sfil/NotI cDNA was purified and sized on a 

35 Biogel column A150M. cDNA was ligated in a Notl/Sfil 
opened pGALlPNiST-1 vector. 
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Transformation of C . albicans 

C. albicans CAI-4 (URA3 : : imm4 34 /URA3 : : imm434 ) was 
kindly provided by Dr. William Fonzi, Georgetown 
5 University (Fonzi and Irwin, 1993) . CAI-4 was 

transformed with above described cDNA library or 
genomic library using a modified spheroplast method 
(Logghe M., submitted). Cells were plated on minimal 
medium supplemented with glucose and sorbitol (SD 
10 (0.67% Yeast Nitrogen base w/o amino acids + 2% 

glucose), 1 M sorbitol) plates using 0.4 cm glass- 
pearls (Glaverbel, Belgium) and incubated for 2-3 days 
at 30°C. 

15 Screening for mutants 

Starter cultures were set up by inoculating each 
colony in 1 ml SD medium and incubating overnight at 
30°C and 300 rpm. Cell densities were determined using 

20 a Coulter counter (Coulter Zl; Coulter electronics 
limited). 250.000 cells/ml were inoculated in SD 
medium for a total volume of 1ml and cultures were 
incubated for 24 hours at 30°C and 300 rpm. Cultures 
were washed in minimal medium without glucose (S) and 

25 the pellet resuspended in 650 ml S medium. 8 pi of 

this culture was used for inoculating 400 \il cultures 
in a Honeywell-100 plate (Bioscreen analyzer, 
Labsystems) . Each transformant was grown for three 
days in S medium containing 50 mM LiAc; pH 6.0, with 

30 2% glucose +/- 2% maltose or 2% galactose +/- 2% 

maltose respectively while shaking (high intensity) 
every 3 minutes for 20 seconds. Optical densities were 
measured every hour and growth curves were generated 
automatically (Bioscreen analyzer; Labsystems) . 



35 



- 23 - 



Construction of LAC 4/ pGALlPNiST-1 

pGALlPNiST-1 vector was cut with StuI in order to 
release the stuffer fragment and subsequently 
5 dephosphorylated (CIP, Boehringer) . Plasmid pRS1004, 
obtained from J. Ernst (University of Duesseldorf, 
Germany) , was cut with PvuII/Xbal in order to release 
the K. lactis fl-galactosidase (EC 3.2.1.23; LAC 4 ) 
reporter gene and Klenow-treated. The LAC4 PvuII/Xbal 
10 blunted reporter gene fragment from pRS1004 was 

ligated into StuI opened pGALlPNiST-1 resulting in the 
integrative plasmid LAC4 / pGALlPNiST-1 

Measurement of GALl promoter activity 

15 

C. albicans strain CAI-4 was transformed with 
LAC4/pGALlpNiST-l using the modified spheroplast 
method (Logghe et al., submitted). Resulting 
transf ormants were grown in 5 ml of respectively non- 
20 induction (SD +/- maltose) and induction (S + galactose 
+ /- maltose) medium and further processed as described 
by Leuker et al. (1997) . 

Isolation of genomic or cDNA inserts 

25 

Potentially interesting transf ormants were grown in 
1.5 ml SD overnight. Genomic DNA was isolated using 
the Nucleon MI Yeast kit (Amersham) and the 
concentration of genomic DNA was estimated by 

30 analyzing a sample on a 0.7% agarose gel in 0.5x TBE 
and comparison to a known standard molecular weight 
marker. 20 ng of genomic DNA was digested for three 
hours with an enzyme that cuts uniquely in the library 
vector (SacI for the genomic library; PstI for the 

35 cDNA library) , treated with RNAse A (Boehringer) and 
incubated for 20 minutes at 65°C to inactivate the 
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enzyme. Samples were phenol/chloroform extracted twice 
and precipitated using NaOAc/ethanol . The resulting 
pellet was resuspended in 500 pi ligation mixture (1 x 
ligation buffer and T4 DNA ligase; both from 
5 Boehringer) and incubated overnight at 16°C. 

After denaturation (10 min 65°C) , purification 
(phenol/chloroform extraction) and precipitation 
(NaOAc/ethanol) the pellet was resuspended in 10 pi 
MilliQ (Millipore) water. 

10 Inverse PCR was performed on 1 pi of the precipitated 
ligation reaction using library vector specific 
primers (Figure 1) ( 3pGALSistPCR: 5' GAG-GGC-GTG-AAT- 
GTA-AGC-GTG 3' and 5pGALNistPCR: 5 ' GAG-TTA-TAC-CCT- 
GCA-GCT-CGA-C 3' for the genomic library; 

15 3pGALNist PCR : 5' TGA-GCA-GCT-CGC-CGT-CGC-GC 3 f and 

SpGALN i s t PC R for the cDNA library; all primers from 
Eurogentec) for 30 cycles each consisting of (a) 1 min 
at 95 °C, (b) 1 min at 61 (or 57 °C for the cDNA 
library primers) , and (c) 3 min at 72 °C. In the 

20 reaction mixture 2.5 units of Taq polymerase 

(Boehringer) with TaqStart antibody (Clontech) (1:1) 
were used, and the final concentrations were 0.2 pM of 
each primer, 3 mM MgCl 2 (Perkin Elmer Cetus) and 200 
pM dNTPs (Perkin Elmer Cetus) . All PCR reactions were 

25 performed in a Robocycler (Stratagene) . 

PCR analysis is also performed on genomic DNA isolated 
from the t ransf ormants using primers 3pGALSist PCR and 
SpGALNist PCR for the genomic library transf ormants and 
using primers oligo23' : 5' TGC-AGC-TCG-ACC-TCG-AGG 3' 

30 and oligo25: 5* GCG-TGA-ATG-TAA-GCG-TGA-C 3' (T hybr = 
53 °C) for the cDNA library transf ormants . 
Resulting PCR products were purified using the PCR 
purification kit (Qiagen) and were quantified by 
comparison of band intensity with the intensity of DNA 

35 marker bands on a ethidium bromide stained agarose 
gel . 
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Sequence determination 

The amount of PGR product (expressed in ng) put in the 
sequencing reaction is calculated as the length of the 
5 PCR product in basepairs divided by 10. DNA sequencing 
reactions were performed using the ABI Prism BigDye 
Terminator Cycle Sequencing Ready Reaction Kit 
according to the instructions of the manufacturer (PE 
Applied Biosystems, Foster City, CA) except for the 

10 following modifications. The total reaction volume was 
reduced to 15 pi. Reaction volumes of individual 
reagents were changed accordingly. The 6.0 pi 
Terminator Ready Reaction Mix was replaced by a 
mixture of 3.0 pi Terminator Ready Reaction Mix + 3.0 

15 pi Half Term (GENPAK Limited, Brighton, UK) . After 

cycle sequencing, reaction mixtures were purified over 
Sephadex G50 columns prepared on Multiscreen HV opaque 
Microtiter plates (Millipore, Molsheim, Fr) and were 
dried in a speedVac. Reaction products were 

20 resuspended in 3 pi loading buffer. Following 

denaturation for 2 min at 95°C, 1 pi of sample was 
applied on a 5% Long Ranger Gel (36 cm well-to-read) 
prepared from Singel Packs according to the supplier's 
instructions (FMC BioProducts, Rockland, ME) . Samples 

25 were run for 7 hours 2X run on a ABI 377XL DNA 

sequencer. Data collection version 2.0 and Sequence 
analysis version 3.0 (for basecalling) software 
packages are from PE Applied Biosystems. 

30 Sequence analysis 

Nucleotide sequences were imported in the VectorNTI 
software package (InforMax Inc, North Bethesda, MD, 
USA) , and the vector and insert regions of the 
sequences were identified. Sequence similarity 
35 searches against public and commercial sequence 
databases were performed with the BLAST software 
package (Altschul et al., 1990) version 1.4. Both the 
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original nucleotide sequence and the six-frame 
conceptual translations of the insert region were used 
as query sequences. The used public databases were the 
EMBL nucleotide sequence database (Stoesser et al., 
5 1998), the SWISS-PROT protein sequence database and 
its supplement TrEMBL (Bairoch and Apweiler, 1998), 
and the ALCES Candida albicans sequence database 
(Stanford University, University of Minnesota) . The 
commercial sequence databases used were the LifeSeq® 

10 human and PathoSeq™ microbial genomic databases 

(Incyte Pharmaceuticals Inc., Palo Alto, CA, USA), and 
the GENESEQ patent sequence database (Derwent, London, 
UK) . Three major results were obtained on the basis of 
the sequence similarity searches: function, novelty, 

15 and specificity- A putative function was deduced on 
the basis of the similarity with sequences with a 
known function, the novelty was based on the absence 
or presence of the sequences in public databases, and 
the specificity was based on the similarity with 

20 vertebrate homologues. 

The 5' UTR region of the SAM2 gene was analysed using 
the "Findpatterns" algorithm of the Genetics Computer 
Group (GCG) software package (University of Wisconsin, 
USA) . 

25 

Northern blot analysis 

Cells were grown to OD 600 - 1.0 and total RNA was 
prepared using the RNeasy midi kit (Qiagen) according 
to the manufacturer's instructions. RNA concentrations 

30 were determined spectrophotometrically by measuring 
optical densities at 260 nm in a UV-1601 UV-visible 
spectrophotometer (Shimadzu) and 5 yg of each sample 
was resolved onto a 1% formaldehyde gel and run in 1 x 
formaldehyde gel running buffer ( 5prime-3prime ) at 3.5 

35 V/cm. RNA was stained for 20 minutes using SYBR Green 
II stain (Molecular probes) 1/10000 diluted in lx 
formaldehyde gel running buffer ( 5prime-3prime ) and 
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subsequently transferred to Hybond-N+ nylon membrane 
(Amersham) by overnight capillary blotting in 20 x 
SSC. DIG-labeled probes were prepared using DIG-dUTP 
(Boehringer Mannheim) at a 1:3 or 1:6 dTTP : DIG-dUTP 
5 ratio, 10 pg of template plasmid DNA, lx PCR buffer II 
(Perkin Elmer Cetus) , 10 \iM of each primer 
(Eurogentec), 0.2 mM of dATP, dCTP and dGTP (Perkin 
Elmer Cetus), 2.5 mM MgCl 2 (Perkin Elmer Cetus), 5% 
DMSO and 1.25 units Taq polymerase (Boehringer). 

10 The membrane was prehybridized at 50 °C (DNA probes) or 
at 68 °C (RNA probes) in DIG Easy Hyb (Boehringer 
Mannheim) for minimum 1 hour. Hybridization was 
performed using 1 \il PCR reaction product (= 1/50 of 
the total volume) /ml DIG Easy Hyb. The probes were 

15 denatured by heating the PCR reaction for 10 minutes 
at 96°C, then quick-chilling on ice. The probe was 
kept on ice for 5 minutes, centrifuged briefly and 
diluted in pre-warmed DIG Easy Hyb solution. The 
entire probe solution was filtered through a 0.45 pm 

20 filter (Millex HV, Millipore) prior to use. 
Hybridizations were carried out overnight. 
Post-hybridization, membranes were washed twice 15 
minutes with 2x SSC/0.1% SDS at room temperature and 
twice 15 minutes with O.lx SSC/0.1% SDS at 68°C. 

25 Detection was performed using the DIG Wash and Block 
Buffer Set as described by the manufacturer 
(Boehringer Mannheim Mannheim) and the blot was 
exposed to Kodak XAR-5 film for 1 hour at ambient 
temperature . 

30 

Real time quantitation of mRNA transcript 
PCR quantitations using specific primers and probes 
were performed according to the TaqMan procedure 
(Livak et al., 1995; Orlando et al., 1998) using the 
35 ABI Prism 7700 sequence detector (Applied Biosystems) . 
Primers and probes for ACT1 (b-actin) and RNR1 genes 
were designed using the PrimerExpress software system 
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(Perkin Elmer Cetus) . Cells were grown to OD 600 ~ 1.0 
and total RNA was prepared using the RNeasy midi kit 
(Qiagen) according to the manufacturer's instructions. 
All RNA samples were DNasel (Boehringer-Mannheim, 
5 RNAse-free) -treated at 20 U/pg in 50 pi solution for 
40 min at ambient temperature, phenol/chloroform- 
extracted and precipitated. Pellets were dissolved in 
20 ml MilliQ water (Millipore) and RNA concentrations 
were determined spectrophoto-metrically . First-strand 

10 cDNA synthesis was performed in a final volume of 20 
pi containing lx Superscript RT buffer (Life 
Technologies) , 10 mM DTT, 125 pM of each dNTP, 50 pM 
hexamer primers (Life Technologies) and 1 mg RNA. 
Mixtures were incubated for 10 min. at ambient 

15 temperature and 1 pi was removed and diluted 1:4 for 
the non-amplification control (NAC) ; 20 U Superscript 
reverse transcriptase (Life Technologies) was added 
and the reaction was incubated for 1 hour at 42 °C. 
The enzyme was inactivated for 10 min at 70°C. PGR 

20 reactions were set up in triplicate for all genes and 
contained 5 ml PCR buffer A, 4 mM MgCl 2 , 200 pM each 
of dATP, dGTP, dCTP and 400 pM dUTP, 250 nM 
fluorogenic probe (for RNR1 : 5' TGA-TCT-CAA-AAA-GTG- 
CTG-GAG-GAA-TCG-GT 3')/ 0.5 U UNG, 1.25 U AmpliTaq 

25 Gold, 16.75 ml H 2 0, 300 nM of appropriate FORWARD (for 
RNR1: 5' CGA-CAC-TTT-GAA-ATC-GTG-TGC-T 3') and REVERSE 
(for RNR1: 5' GCA-CCG-GTA-GAA-CGA-ATG-TTG 3') PCR 
primers, 1 ml of the RT reaction mixture. 
For the NAC, 1 pi of the 1:4 diluted RTase-negative 

30 sample was added while 1 pi of H 2 0 was added to each 
non-template control sample. The ABI PRISM 7700 was 
run for 50 cycles of 15 s at 95°C, 1 min at 60°C. 
These cycles were preceded by 5 min at 50 °C (UNG 
activation) and 10 min at 95°C (UNG inactivation and 

35 DNA denaturation) . 

Data were analyzed using the ABI PRISM 7700 software 
package. Data were normalized according to ACT1 C T 
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values . 

Library screening 

Using primers SpGalNistPCR and 3pGalNist PCR, a 0.6 kb 
5 region of the C. albicans SAM2 . gene was PCR-amplif ied 
from a SAM2/pGALlpNiST-l construct isolated from clone 
36 and labeled with [ 32 P]dCTP using the Multiprime™ 
random-primed labeling system (Amersham) . C. albicans 
genomic DNA isolated from strain B2630 was partially 

10 digested with Sau3AI, resolved on a 0.7% agarose gel 
and the region of the gel with the fragment size of 
interest (10-23kb) was cut out and DNA was eluted from 
the gel with Sephaglass Band Prep, kit (Pharmacia). A 
C. albicans library in pYCP50 was prepared by ligating 

15 these fragments into a BamHI cut and dephosphorylated 
pYCPSO vector in a 1:2 molar ratio vector to insert. 
The titer ( #colonies/pg DNA) was determined by 
transforming a fraction of the library to E. coli. 
Five genome equivalents were plated out and filter- 

20 lifts were prepared as described (Sambrook et al., 

1989) . Duplicate nylon filters were pre-washed for 2 
hours at 42°C in 50 mM Tris, 1M NaCl, 0.1% SDS, 1 mM 
EDTA to reduce background hybridization. The filters 
were subsequently hybridized at 42 °C overnight in 5x 

25 SSPE, 50% formamide, 5x Denhardt's solution, 0.1% SDS, 
100 pg/ml denatured salmon sperm DNA and 10 6 cpm/ml of 
denatured probe. Filters were then washed in 2x SSC, 
0.5 % SDS for 1 hour at room temperature and for 1 
hour at 50°C. A few intense autoradiographic spots 

30 were found and the corresponding colonies were 

selected for plasmid preparation. Candidate clones 
were digested with a panel of restriction enzymes, 
resolved on a 0 . 7 % agarose gel, stained with 
ethidiumbromide and transferred to nylon membrane by 

35 vacuum-blotting. The blot was probed under the same 
conditions as the genomic library. A 1.1 kb Hpal 
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fragment covering the entire hybridizing segment was 
subcloned into pCR-Blunt (Invitrogen) 

Screening for compounds modulating expression of 
5 polypeptides critical for growth and survival of C. 
albicans 

The method proposed is based on observations 
(Sandbaken et al., 1990; Hinnebusch and Liebman 1991; 
Ribogene PCT WO 95/11969, 1995) suggesting that 

10 underexpression or overexpression of any component of 
a process (e.g. translation) could lead to altered 
sensitivity to an inhibitor of a relevant step in that 
process. Such an inhibitor should be more potent 
against a cell limited by a deficiency in the 

15 macromolecule catalysing that step and/or less potent 
against a cell containing an excess of that 
macromolecule, as compared to the wild type (WT) cell. 

Mutant yeast strains, for example, have shown 
that some steps of translation are sensitive to the 

20 stoichiometry of macromolecules involved. (Sandbaken 
et al.). Such strains are more sensitive to compounds 
which specifically perturb translation (by acting on a 
component that participates in translation) but are 
equally sensitive to compounds with other mechanisms 

25 of action. 

This method thus not only provides a means to 
identify whether a test compound perturbs a certain 
process but also an indication of the site at which it 
exerts its effect. The component which is present in 

30 altered form or amount in a cell whose growth is 

affected by a test compound is potentially the site of 
action of the test compound. 

The assay to be set up involves measurement of 
growth of an isogenic strain which has been modified 

35 only in a certain specific allele, relative to a wild 
type (WT) C. albicans strain, in the presence of R- 



- 31 - 



compounds. Strains can be ones in which the 
expression of a specific essential protein is impaired 
upon induction of anti-sense or strains which carry 
disruptions in an essential gene. An in silico 
5 approach to finding novel essential genes in C. 

albicans will be performed. A number of essential 
genes identified in this way will be disrupted (in one 
allele) and the resulting strains can be used for 
comparative growth screening. 

10 

Assay for High Throughput screening for drugs 
35 minimal medium (S medium + 2% galactose + 
2% maltose) is transferred in a transparent flat- 
bottomed 96 well plate using an automated pipetting 

15 system (Multidrop, Labsystems) . A 96-channel pipettor 
(Hydra, Robbins Scientific) transfers 2.5 /il of R- 
compound at 10" 3 M in DMSO from a stock plate into the 
assay plate. 

The selected C. albicans strains (mutant and 

20 parent (CAI-4) strain) are stored as glycerol stocks 
(15%) at -70°C. The strains are streaked out on 
selective plates (SD medium) and incubated for two 
days at 30°C. For the parent strain, CAI-4, the medium 
is always supplemented with 20 /zg/ml uridine. A single 

25 colony is scooped up and resuspended in 1 ml minimal 
medium (S medium + 2% galactose + 2% maltose) . Cells 
are incubated at 30°C for 8 hours while shaking at 250 
rpm. A 10 ml culture is inoculated at 250.000 
cells/ml. Cultures are incubated at 30°C for 24 hours 

30 while shaking at 250 rpm. Cells are counted in Coulter 
counter and the final culture (S medium + 2% galactose 
+ 2% maltose) is inoculated at 20.000 to 50.000 
cells/ml. Cultures are grown at 30°C while shaking at 
250 rpm until a final PD of 0.24 (+/- 0.04) 6nM is 

35 reached. 

200 //l of this yeast suspension is added to all 
wells of MW96 plates containing R-compounds in a 450 
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/il total volume- MW96 plates are incubated (static) at 
30°C for 48 hours. 

Optical densities are measured after 48 hours. 

Test growth is expressed as a percentage of 
5 positive control growth for both mutant (x) and wild 
type (Y) strains. The ratio (x/y) of these derived 
variables is calculated. 

Results 

10 

A C. albicans integrative vector, pGALlPSiST-1, was 
constructed to allow non-directional cloning of C. 
albicans genomic DNA fragments (Figure 2) . The vector 
contains an inducible GAL1 promoter, a Sfil-cloned 

15 stuff er fragment, a C. albicans URA3 selection marker 
and elements to allow autonomous replication and 
selection in E coli. A C. albicans genomic DNA library 
was prepared by ligating small genomic DNA fragments 
(400 to 1000 bp) which were linked to Sfil adaptors into 

20 the Sfil opened vector pGALlPSiST-1 vector. Genomic DNA 
fragments (450 ng) were ligated into the pGALlPSiST-1 
vector (20 ng) . After electroporation into E. coli 
approximately 400,000 clones were obtained. Plasmid DNA 
was prepared of ... clones; 91% contained an insert with 

25 an average length of 600 bp. The size of the library 
corresponds to over 5 times the diploid genome with 
genomic DNA inserts oriented in sense or antisense 
direction in the vector. 

A similar C. albicans integrative vector, 

30 pGALlPNiST-1, was constructed to allow Sfil/Not I 
directional cloning of C. albicans cDNA fragments 
(Figure 2) . The Sfil/NotI cDNA was purified and sized 
on a Biogel column A150M. The first fraction contained 
approximately 38,720 clones upon transformation to E. 

35 coli with an average insert size of 1500 bp. cDNA from 
this fraction was ligated into a Notl/Sfil opened 
pGALlPNiST-1 vector. 
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C. albicans strain CAI-4 was transformed with the 
aforementioned genomic and cDNA libraries. Upon 
homologous recombination between the insert (partial or 
complete gene) in a library clone and the corresponding 
5 gene in the Candida genome, this gene is (partially if 
the gene is not full-length) duplicated (Figure 1) . The 
first copy of the gene is flanked upstream by its native 
promoter and downstream by the GAL1 promoter. The 
direction of transcription from the native promoter is 

10 opposite to that of the GAL1 promoter. Induction of the 
GAL1 promoter might thus lead to altered expression of 
the gene at the integration site- Moreover, if the cDNA 
does not contain the entire 5' coding region, the first 
copy of the gene may not give rise to any more to a 

15 functional protein. The second copy of this gene has 
lost its promoter and will therefore not be transcribed 
(Figure 1) . 

Upon integration at the site of the GAL1 promoter, 
the promoter is duplicated yielding an integrated gene 

20 fragment under control of the GAL1 promoter (Figure 1) . 

Growth curves were measured in the presence of 
lithium acetate. Figure 3 shows growth curves of the 
wild type CAI-4 strain and transf ormants -resulting from 
cDNA library transformation- showing either an overall 

25 impaired growth (clone 38; Figure 3C) or galactose- 
induced (clone 36; Figure 3B) reduction in growth. This 
analysis was performed in S-glucose medium as a non- 
induction medium and S-galactose medium as an induction 
medium. The results shown in Figure 3A show that also 

30 the wild type strain shows reduced growth in antisense 
induction medium. This is because galactose is used 
rather inefficiently as a carbon source by C. albicans. 
In order to solve this problem and facilitate the 
selection procedure an extra carbon source, maltose, was 

35 added to both inducing and non-inducing medium. Again 
growth patterns varied significantly from transformant 
to transformant but growth of the parental strain CAI-4 



- 34 - 



was nearly identical in both media (Figure 3D) . Strains 
impaired in growth upon promoter activation showed a 
clear shift in the growth curve in medium supplemented 
with both galactose and maltose (clone 415; Figure 3E) . 
5 Overall impaired growth was, as expected, not strongly 
influenced by the addition of maltose (clone 360; Figure 
3F) . 

To verify that maltose as an extra carbon source 
did not affect the strength and inducibility of the GAL1 

10 promoter, promoter activity was measured using 
Kluyveromycfcs lactis LAC 4 reporter gene expression. CAI- 
4 was transformed with LAC4/pGALlpNiST-l . Four 
individual transf ormants (named Q, R, S, T) were grown 
in glucose, galactose, glucose+maltose and 

15 galactose+maltose media and B-galactosidase activity was 
measured (Figure 4) . It is clear that the presence of 
maltose does not significantly influence the induction 
ratio of the GAL1 promoter. 

From a total of over 2000 transf ormants screened, 

20 198 (-10%) showed an impaired growth phenotype and were 
selected for further analysis. Fourty-three % of these 
slow growers showed a growth pattern corresponding with 
a putative promoter interference or antisense effect, 
57% showed overall impaired growth. PCR analysis with 

25 SpGALNiSTPCR and 3pGALNiSTPCR primers on genomic DNA 
from the transf ormants can reveal integration outside 
the gene showing sequence identity with the insert DNA, 
eg. at the GAL1 promoter region . (Figure 1) . Of all 
transf ormants screened by PCR using these primers, 

30 ~ 11% showed integration at a non-insert location. 

When the insert of an antisense library clone 
recombines with the homologous gene in the C. albicans 
genome, no PCR product can be obtained upon 
amplification with SpGALNiSTPCR and 3pGALNiSTPCR primers 

35 on genomic DNA (Figure 1) . To release the plasmid from 
the genome and determine the integration site, genomic 
DNA was isolated from the transf ormants , cut (with SacI 
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for the genomic library t ransf ormants and with PstI for 
the cDNA library transf ormants ) , religated and the 
resulting ligation reaction was precipitated and used 
as a template for inverse PCR. This procedure reveals 
5 homologous integration at the insert site as well as the 
number of integrations (assuming PCR products are of 
different lengths) within the Candida genome. This 
analysis was performed on all selected transf ormants, 
-32 % of which showed multiple integrations. The 

10 frequency of multiple integrations was very variable and 
depended on' the batch of transf ormants analyzed. 
The resulting PCR products from both analyses were 
subsequently sequenced and the sequences by compared 
with both public and proprietary sequence databases. In 

15 total 86 different genes could be identified, 45 of 
which were of unknown function. 

For the CAI-4 transf ormants obtained with a genomic 
(non-directionally cloned) library, 26% of the selected 
clones (n=~150) contained the C. albicans autonomous 

20 replicating sequence, ARS2 , and 15% of the clones 
contained a ribosomal RNA fragment. 

For the CAI-4 transf ormants obtained with a cDNA 
library (n=-1850) a whole series of different gene 
fragments was found. As expected, also a number of genes 

25 involved in carbon source metabolism and nutrient uptake 
were identified. 

Two examples of identified genes will be discussed, 
although as seen in Figures 16 to 70 similiar results 
were obtained for all of the sequences according to the 

30 invention. Clone 36 shows a galactose-induced impairment 
in growth, suggestive of a promoter interference or 
antisense effect (Figure 3B) . In this clone 
recombination had occurred at the insert site as shown 
by amplification of a -600bp gene fragment by inverse 

35 PCR. The sequence of the isolated gene fragment was 74 

% identical to a S. cerevisiae S-adenosyl methionine 
synthetase 2 (SAM2) gene. Effects on SAM 2 mRNA were 
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assessed by Northern blots on total RNA extracted from 
a non-transformed control strain and from clone 36 grown 
either in ant isense-inducing or non-inducing media. The 
Northern blot was hybridised with an in vitro 
5 synthesized SAM2 RNA sense probe to detect antisense 
transcripts (Figure 5). An identical Northern blot was 
hybridised with an in vitro synthesized SAM2 antisense 
probe to detect SAM2 mRNA (Figure 5) . Both blots were 
subsequently hybridized with a TEF3 DNA probe to allow 

10 normalization. As the sequence of the C. albicans SAM2 
gene was not available at the time, a C. albicans 
genomic library in pYCpSO was prepared and E. coli 
trans formants were screened for the full-length gene 
using the 600 bp SAM2 PCR fragment as a probe. A 

15 strongly hybridizing clone was identified and designated 
clone 36.13.1. This clone contained the complete ORF 
(1155 bp) of the SAM2 gene including 5' and 3' flanking 
regions. In the very A/T-rich 5' flanking region a 
putative TATA box could be identified at -27 bp. The 3' 

20 flanking region contains multiple T-rich (>10 bp) 
regions, elements described in yeast as necessary for 
transcript release (Reeder and Lang, 1997) . The complete 
SAM2 mRNA transcript thus has a predicted length of 1.2 
kb. 

25 Clone 38 showed impaired growth in both non- 

inducing and inducing media (Figure 3) ; this is expected 
when integration of the library plasmid itself leads to 
gene suppression. Clone 38 contained a 340 bp fragment 
of the ribonucleotide reductase 1 (RNR1) gene. RNR1 mRNA 

30 levels were analysed by Northern blot and quantitative 
PCR in a non-transformed control strain and clone 38 
grown in S+glucose medium. The Northern blot was 
hybridised successively with an actin and an RNR1 
doublestranded DNA probe (Figure 6). Although the li- 

35 actin transcript level in the control strain is lower 
compared to clone 38, the relative amount of RNR1 
transcript is higher, indicating a reduced level of RNR1 
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transcript in clone 38. This result was confirmed by 
Taqman quantitative PCR on both control strain and clone 
38 using a RNR1 fluorogenic probe. Resulting Ct values 
were calculated and normalised for B-actin (Figure 6) . 
5 Again RNR1 transcript levels in clone 38 were found 
reduced compared to the control strain. 

To verify that the growth-effect was due to the 
interference with the identified gene and to support the 
spcificity of the antisense effect, single allele knock- 

10 outs were made in 6 identified genes using the URA- 
blaster method (Fonzi and irwin, 1993) . Disruption of 
one allele of a gene should in theory lead to - 50 % 
reduction in gene transcript. In practice however we 
have observed reductions varying between 10 and 100 % 

15 of normal level. This can probably be explained by the 
fact that not always both copies of a gene are 
functional. That only a single integration at the 
corerct site had occurred for each of the disruption 
cassettes was verified by PCR and Southern blot 

20 analysis. Growth curves were measured; three disruptants 
showed impaired growth, suggesting that a gene required 
for growth or survival was targeted. Experiments to take 
over control of the second allele of each gene -by 
promoter replacement- are ongoing. 

25 The present application describes new methods to 

diminish endogenous gene expression in the medically 
important yeast C. albicans. Our approach proved very 
useful for the identification of genes required for 
growth or survival. Technical hurdles consisted of the 

30 lack of an efficient transformation method for C. 

albicans (Logghe M., submitted) and the need to measure 
growth reproducibly on a large number of transf ormants 
in parallel. The latter was achieved with a Bioscreen 
Analyzer (Labsystems) which can simultaneously measure 

35 growth in 200 cultures and subsequently generate growth 
curves automatically. Although in principle this kind 
of screening could be done on plates we could not 



- 38 - 



achieve satisfactory reproducibility using plate 
screening . 

In our genomic screen, integration of the library 
plasmid can happen either at the endogenous GAL1 
5 promoter locus or, more frequently, at the locus 
corresponding to the plasmid insert. The latter results 
in a gene duplication with the first copy of the gene 
flanked by two convergently oriented promoters- The use 
of such a "collision construct" has previously been 

10 described in screening for inhibitors of transcriptional 
activation in mammalian cells (patent WO 97/10360; Giese 
K.) . If RNA polymerase II complexes start from both the 
upstream and downstream, oppositely oriented, promoter 
regions, they may collide thereby preventing the 

15 formation of a full-length mRNA transcript. The second 
copy of the gene has no more a promoter and is probably 
5' crippled as the original inserts cloned into the 
library have an average length of -1.5 kb while ORFs in 
C. albicans have an average length of ... and we ourselves 

20 identified ORFs of (unknown) genes larger than 7 kb. 

Upon integration of a plasmid into the C. albicans 
genome, reduced function of the protein encoded by the 
disrupted gene can be due to several mechanisms: 1) The 
first copy of the duplicated gene can be prevented from 

25 forming functional sense transcript by promoter 
collision or the sense transcript may be inhibited by 
true antisense . Indeed, although a 1.2 kb SAM2 
antisense transcript could be detected in clone 36 we 
cannot exclude the growth defect being due to promoter 

30 interference. If an antisense transcript is formed from 
an intact SAM2 gene, we expect a transcript of at least 
1055 bp; no transcription terminator was engineered 
upstream of this gene so transcription will proceed 
until an appropriate transcription termination 

35 recognition site is reached. The promoter region of the 
SAM2 gene is particularly A/T rich and contains a 
reversed yeast transcription terminator site at - 118 
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(with translation starting at +1). In yeast, 
transcription terminator sites are ill-defined but for 
a T-rich stretch with non-T residues situated 
appropriately to prevent slippage (Jeong et al., 1996; 
5 Reeder and Lang, 1997). If termination of transcription 
occurs at this theoretically predicted site, a 1.17 kb 
transcript would be expected, as was found. 2) If 
mutations were present in the original library clone, 
the protein encoded by the gene after homologous 

10 recombination could be non-functional. 3) Possible cis 
down-regulatory effects on adjacent genes could be 
induced upon integration of large DNA fragments at 
certain locations within the genome. 4) Finally, gene 
disruption could occur by recombination with cDNA that 

15 is not full-length at the 5' end. 

If -on the contrary- integration happens at the 
endogenous GAL1 promoter site, the GAL1 promoter is 
duplicated and antisense can be induced from this 
promoter. Promoter collision is not possible as the 

20 endogenous promoter of the gene is lacking at the 
integration site. Integration at a non-homologous site 
within the genome is rare. Transformation efficiencies 
of 0.7-3 transf ormants/pg have been reported upon 
transformation of CAI-4 with non-homologous plasmid DNA 

25 (Thompson et al., 1998). Also, integration at the URA3 

locus is very unlikely as the complete URA3 gene has 
been removed from the CAI-4 genome. 

Irrespective of the mechanism responsible for gene 
suppression, we could identify genes required for growth 

30 or survival by screening for transf ormants showing 
either galactose-induced or complete growth block. 
Furthermore, for such genome-wide screening no prior 
sequence information is needed and it allows discovery 
of possibly new critical functions. However, some genes 

35 may only be critical under conditions different from 
growth in minimal medium (as used in our screening) e.g. 
environments with high oxygen tension, high osmolarity 
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or high pH - However, it would be possible to screen for 
a growth phenotype under these conditions using our 
screening method. Alternatively, some genes may play 
critical roles only under unusual growth states or may 
5 specifically be required for adaptation to conditions 
encountered during infection of a host. 

More than half of the ORFs we have identified as 
being critical for growth have a completely unknown 
function- Even though required for growth in C. 

10 albicans, for some genes no homologues could be found 
in existing 'databases, suggesting that they are species- 
specific genes. Indeed, recent genome sequencing efforts 
(e.g. Mycoplasma genitalium (Fraser et al., 1995), 
Haemophilus influenzae (Fleischmann et al, 1995)) have 

15 shown that approximately 20 % of the predicted ORFs in 
a microbial genome can be species-specific. 

One disadvantage of the technique is that multiple 
library plasmids can integrate in the genome of a single 
C. albicans cell. When this occurs, identification of 

20 the target responsible for the growth defect becomes 
more difficult. Also, as shown in Schizosaccharomyces 
pombe (Atkins et al., 1995), RNA-mediated suppression 
may not be effective for certain genes, which we would 
miss when relying only on this technique. 

25 Rather unexpectedly, transformation with the 

genomic library and subsequent screening for 
transf ormants showing reduced growth frequently yielded 
ARS2- and rRNA-containing clones (in 26 and 15% 
respectively of the transf ormants showing reduced 

30 growth) . Previously, a study of aging yeast mother cells 
had shown that accumulation of extrachromosomal rDNA 
circles (ERCs) occurs in old cells and that these ERCs 
actually cause aging (Sinclair et al., 1997; Johnson et 
al., 1999). rDNA is present at 100-200 tandem copies on 

35 chromosome XII of S. cerevisiae and was found to 
accumulate to about 1000 copies in senescent cells. 
One other gene we identified is a homologue of the 
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essential S. cerevisiae gene TRA1, a protein kinase 
showing highest identity to the human TRRAP gene 
(McMahon et al. , 1998) which is an ataxia telangiectasia 
mutated (ATM) -related gene. Loss of ATM is a genetic 
5 defect identified in ataxia telangiectasia (Johnson et 
al., 1999), a disease in humans where life span is 
typically reduced to 40-50 years. We might thus have 
picked up a number of growth-inhibitory effects due to 
induction of aging. 

10 The strategy presented should be applicable to all 

organisms for which existing techniques for "en masse" 
gene disruption are not easily applicable because of 
their diploid nature and lack of sexual cycle and might 
prove especially useful for other diploid imperfect 

15 - yeasts. 

Although the genomic strategy that we described 
cannot substitute for a comprehensive investigation of 
individual genes and pathways, it can provide a good 
starting point for such investigation. 
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TABLE 1 



Sea ID No. Clone Function 



1 214c_cpL1 

2 113g2 

3 222g8 

4 222g8(prt) 

5 222g9 

6 222g9_CDS_1 

7 • 222g9_CDS_2 

8 222g9_CDS_3 

9 222g9_CDS_4 

10 24gG 

1 1 28gK 

12 328c1 

13 328c1(prt) 

14 33gK 

15 33gK(prt) 

16 3gG 

17 58gA 

18 21g2 

19 21g2(prt) 5' UTR TRA1 

20 223c_Cp CFL 

21 357cL 

22 357cL(prt) RPL27 

23 110c_af 

24 110c_af(prt) SADH 

25 CDC48 

26 CDC48(prt) CDC48 

27 99g3 

28 99g3(prt) CIT 

29 ESP1 

30 ESP1(prt) ESP1 

31 190g3 

32 190g3(prt) FAL1 

33 249c_af 

34 249c_af(prt) MAA 

35 485cL 

36 485cL(prt) RPL16 
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